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(54) Method and apparatus for encoding a contour of an object 



(57) A method for encoding a video signal of a con- 
tour of an object, which is capable of reducing the 
amount of transmission data through selectively trans- 
forming the approximation errors, is disclosed. The 
method divides the contour into a multiplicity of contour 
segments, and approximates each of the contour seg- 
ments by a line segment, thereby determining positions 
of vertices on the contour. Then, it calculates a set of 
errors between the contour segment and the line seg- 
ment, wherein the set of errors is a difference between 
each contour segment and the line segment corre- 
sponding thereto, and then calculates a value for deter- 



mining the difference between the contour segment and 
the line segment. Thereafter, the method encodes and 
generates the positions of the vertices on the contour 
and the set of errors if the calculated value for determin- 
ing the difference between the contour segment and the 
line segment is larger than a threshold value TH. Alter- 
natively, if the calculated value is smaller than the 
threshold value TH, the method encodes and generates 
only the positions of the vertices on the contour, thereby 
reducing the amount of transmission data. 
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Description 

The present invention relates to a method and 
apparatus for encoding a video signal of a contour of an 
object; and, more particularly, to a method and appara- 
tus capable of reducing the amount of data by adap- 
tively encoding approximation errors. 

In digitally televised systems such as video-tele- 
phone, teleconference and high definition television 
systems, a large amount of digital data is needed to 
define each video frame signal since the video frame 
signal comprises a sequence of digital data referred to 
as pixel values. Since, however, the available frequency 
bandwidth of a conventional transmission channel is 
limited, in order to transmit the substantial amount of 
digital data therethrough, it is necessary to compress or 
reduce the volume of data through the use of various 
data compression techniques, especially, in the case of 
such low bit-rate video signal encoders as video tele- 
phone and teleconference systems. 

One of such techniques for encoding video signals 
for a low bit-rate encoding system is an object-oriented 
analysis-synthesis coding technique, wherein an input 
video image is divided into objects and three sets of 
parameters for defining the motions, the contours, and 
the pixel data of each object are processed through dif- 
ferent encoding channels. 

In processing a contour of an object, contour infor- 
mation is important for the analysis and synthesis of the 
object shape. A classical coding method for represent- 
ing the contour information is a chain coding method. 
The chain coding method, however, requires a substan- 
tial amount of bits for the representation thereof, 
although the method does not incur any loss in the con- 
tour information. 

To overcome the drawback, there have been pro- 
posed several methods to encode the contour informa- 
tion such as a polygonal approximation and a B-spline 
approximation. One of the disadvantages in the polygo- 
nal approximation is the roughness of the representa- 
tion of the contour. The B-spline approximation is, on 
the other hand, capable of representing the contour 
more precisely; however, it requires a high-order poly- 
nomial to reduce the approximation error, resulting in an 
increased overall computational complexity of the video 
encoder. 

One of the techniques introduced to ameliorate 
such problems associated with the coarse representa- 
tion of the contour and the increased computational 
complexity in the above approximation approaches is a 
contour approximation technique employing a discrete 
sine transform(DST). 

In an apparatus which adopts the contour approxi- 
mation technique based on the polygonal approxima- 
tion and the DST, as disclosed in a commonly owned 
copending application, EP-A-0 734 163, entitled "A 
CONTOUR APPROXIMATION APPARATUS FOR REP- 
RESENTING A CONTOUR OF AN OBJECT, a number 



of vertex points are determined and the contour of the 
object is approximated through the use of a polygonal 
approximation for fitting the contour by line segments. 
And, N sample points on each line segment are 

5 selected, and an approximation error at each of the N 
sample points located on each line segment is sequen- 
tially calculated in order to obtain a set of approximation 
errors for each line segment. And each of the approxi- 
mation errors represents the distance between the con- 

70 tour and the line segment at each of the N sample 
points. Thereafter, a set of DST coefficients is gener- 
ated by performing a one-dimensional DST operation 
on each set of the approximation errors. 

Even though it is possible to remedy the rough rep- 

75 reservation and computational complexity and reduce 
the volume of transmission data through the use of the 
DST-based contour approximation, it still remains desir- 
able to further reduce the volume of transmission data 
in order to efficiently implement a low-bit rate codec sys- 

20 tern having, e.g., a 64 kb/s transmission channel band- 
width. 

It is, therefore, a primary object of the present 
invention to provide an improved method and apparatus 
for encoding a video signal of a contour of an object, 
25 which is capable of reducing the amount of transmission 
data through selectively transfer rhing the approximation 
errors. 

In accordance with the present invention, there is 
provided a method for encoding a video signal of a con- 

30 tour of an object, comprising the steps of: dividing the 
contour into a multiplicity of contour segments, and 
approximating each contour segment by a line segment 
joining two end points of each contour segment, thereby 
determining positions of vertices on the contour; encod- 

35 ing the positions of the vertices on the contour ; calculat- 
ing a set of errors for each contour segment, wherein 
the set of errors represents a difference between each 
contour segment and the line segment corresponding 
thereto; coding the set of errors to provide a set of 

40 coded errors; estimating an approximation error 
between each contour segment and the line segment 
corresponding thereto; if the approximation error is 
larger than a threshold value TH, providing the set of 
coded errors and the set of encoded positions of the 

45 vertices on the contour as encoded data for the contour; 
and if the approximation error is smaller than or equal to 
the threshold value TH, providing the set of the encoded 
positions of the vertices on the contour as the encoded 
data for the contour. 

so The above and other objects and features of the 
present invention will become apparent from the follow- 
ing description of the preferred embodiments, when 
given in conjunction with the accompanying drawings, 
wherein: 

55 

Fig. 1 shows a schematic block diagram of an appa- 
ratus for encoding a contour of an object in accord- 
ance with the present invention; 
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Figs. 2A to 2D offer a polygonal approximation 
process; 

Figs. 3A and 3B provide exemplary diagrams, each 
representing errors between a line segment joining 
two vertex points and its corresponding contour 
segment; and 

Figs. 4A to 4E present exemplary diagrams setting 
forth methods for determining the difference 
between a contour segment and a line segment in 
accordance with the present invention. 

Referring to Fig. 1 , there is provided a block dia- 
gram of a contour encoder 20 for encoding a video sig- 
nal of a contour of an object in accordance with the 
preset invention. 

Contour image data is inputted to a polygonal 
approximation block 1 1 , the contour image data repre- 
senting position information of contour pixels constitut- 
ing the contour of the object. 

At the polygonal approximation block 1 1 , a plurality 
of vertices are determined on the contour through the 
use of the conventional polygonal approximation tech- 
nique. By performing the polygonal approximation, the 
contour is divided into a multiplicity of contour seg- 
ments. Each of the contour segments represents a por- 
tion of the contour made of two adjacent vertices along 
the contour and contour pixels disposed therebetween; 
and is approximated by a line segment joining the two 
adjacent vertices. The polygonal approximation block 
1 1 provides contour segment data for each of the con- 
tour segments to an error detection block 12 and a deci- 
sion block 16 via a line L10, and also provides vertex 
data to a vertex coder 17 via a line L20, wherein the 
contour segment data represents position information of 
contour pixels and vertices constituting a contour seg- 
ment and the vertex data denotes position information 
of the vertices included in the contour segment. 

Referring to Figs. 2A to 2D, there is illustrated a 
polygonal approximation process to an exemplary con- 
tour 10. 

As the contour 10 is of an open loop, two end 
points, e.g., A and B, are selected as starting vertices. 
On the other hand, if the contour to be approximated is 
of a closed loop, two farthest points on the contour are 
selected as the starting vertices. And then, the farthest 
point on the contour 10 from a line segment AB, e.g., C, 
is determined. If the distance D max from the point C to 
the line segment AB is greater than a predetermined 
threshold TH, the point C is selected as a vertex. This 
procedure is repeated until the D max for each line seg- 
ment connecting two adjacent vertices along the con- 
tour 10 becomes equal to or smaller than the 
predetermined threshold TH. 

As a result of the above procedure, the vertices, 
e.g., A to G, the contour segments, e.g.. curve AD to 
curve EB, and the line segments, e.g., AD to EB, are 
produced as shown in Fig. 2D. 

The number of vertex points depends on the prede- 



termined threshold TH. As can be seen from Figs. 2A to 
2D, the approximation of the contour 10 with line seg- 
ments becomes more similar to the contour 10 as the 
predetermined threshold TH becomes smaller, at the 

5 expense of coding efficiency. Therefore the threshold 
TH is determined under the consideration of the amount 
of data to be transmitted. 

Referring back to Fig. 1 , at the vertex coder 1 7, the 
vertex data for the contour segments is encoded by 

10 using, e.g., the conventional octant-based vertex coding 
technique or an arithmetic coding technique. The 
encoded vertex data is then provided to a multiplexor 
19. 

The error detection block 1 2, responsive to the con- 
is tour segment data for a contour segment, takes N sam- 
ple points on the line segment; and calculates an error 
at each sample point, to thereby provide a set of errors 
for each contour segment to a transform block 13 and 
the decision block 16, N being a positive integer, 

20 wherein the sample points on the line segment are equi- 
distanced from each other in a preferred embodiment of 
the invention, and the set of errors represents a differ- 
ence between each contour segment and the line seg- 
ment corresponding thereto. 

25 In accordance with the preferred embodiment of the 
invention, an error represents a displacement from a 
sample point to an intersection formed by the contour 
segment and a line drawn normal to the line segment 
from the sample point, the error including a distance 

30 between the sample point and the intersection and a 
sign indicating a relative position of the intersection with 
respect to the line segment. 

Referring to Fig. 3A or 3B, there is illustrated an 
exemplary diagram representing errors between the line 

35 segment AD or CF and its corresponding contour seg- 
ment, wherein the displacement, e.g., d1 or dV from a 
sample point S1 on the line segment AD or a sample 
point S1 • on the line segment CF to its corresponding 
intersection on the contour segment represents the 

40 error at the sample point SI or SV; and the displace- 
ments d1 to d8 at the sample points S1 to SB and dl * to 
d8' at the sample points Si * to S8* are the elements of a 
set of errors for the line segment AD and CF, respec- 
tively. 

45 Referring back to Fig. 1, the transform block 13, 
which receives the set of errors for the contour segment, 
performs one-dimensional transform to the set of errors 
from the error detection block 12 by using a predeter- 
mined transform method, e.g., discrete sine trans- 

so form(DST) or discrete cosine transform(DCT), to 
thereby produce a set of transform coefficients for each 
set of errors and provide same to a quantization block 
14. 

Consequently, the quantization block 14 quantizes 
55 the set of transform coefficients from the transform 
block 1 3 and provides a set of quantized transform coef- 
ficients corresponding thereto to a switching block 1 5. 
At the decision block 16, an approximation error 
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between each contour segment and a corresponding 
line segment thereof is estimated; and it is determined 
based on the magnitude of the estimated difference 
whether a set of errors for each contour segment is to 
be encoded and transmitted to a receiving end. 5 

Referring to Figs. 4 A to 4E, there are presented 
exemplary diagrams setting forth various methods for 
determining the approximation error between a contour 
segment CS and a line segment LS in the decision 
block 16 shown in Fig. 1, respectively. 10 

In a preferred embodiment shown in Fig. 4A, the 
difference is defined by a length L*,^ of the line seg- 
ment LS joining two vertex points V k , V k+1 , the length 
t-k.k+1 being calculated as follows: 

. 15 

wherein y k , Xfc +1 and y k+1 represent the coordinates 20 
of the two vertex points V k( V k+1 , respectively. 

As shown in Fig, 4B, the approximation error may 
be defined as a value dividing an area A^-, confined 
by the line segment LS and its corresponding contour 
segment CS by the length L k k+1 of the line segment LS. 25 

In another preferred embodiment of the invention 
shown in Fig. 4C, a value obtained the approximation 
error is defined by dividing the number of pixels P sur- 
rounded by the line segment LS joining the two vertex 
points V kl V k+1 and the contour segment CS by the 30 
length L^^. 

In Fig. 4D, the approximation is calculated by divid- 
ing a distance D from the line segment LS to the farthest 
point on the contour segment CS by the length L* k+1 . 

In Fig. 4E, the sum of the distances, e.g., E 1( E 2 , 35 
E3, E 4 , between the contour segments and the N sam- 
ple points, e.g., S 1( S2, S3, S 4 , taken on the line seg- 
ment LS divided by the length L k k+1 is defined as the 
approximation error. 

Referring back to Fig. 1 , the decision block 1 6 com- 40 
pares the approximation error obtained by one of the 
methods described with Figs 4A to 4E with a preset 
threshold TH'; and outputs to the switching block 15 a 
first selection signal if the approximation error is greater 
than TH' and a second selection signal if otherwise. The 45 
switching block 15 which receives the set of quantized 
transform coefficients from the quantization block 14 
performs a switching operation according to a selection 
signal from the decision block 16. If the switching block 
15 receives the first selection signal, it transfers the set so 
of quantized transform coefficients to a statistical coder 
18. Alternatively, if the switching block 15 receives the 
second selection signal, it intercepts the transfer of the 
set of quantized transform coefficients therethrough. 

In case the set of quantized DST coefficients is ss 
transferred to the statistical coder 18 by the switching 
block 15, the statistical coder 18 encodes the set of 
quantized transform coefficients through the use of a 



conventional statistical coding scheme, e.g., VLC(varia- 
ble length coding) technique, and then transmits the 
statistically coded data of the set of quantized transform 
coefficients to the multiplexor 19, which also receives 
the encoded vertex data from the vertex coder 17. The 
multiplexor 19 provides the statistically coded data and 
the encoded vertex data to a transmrtter(not shown) for 
the transmission thereof. 

As described above, in accordance with the present 
invention statistically coded data of a set of errors for s 
contour segment is adaptively transmitted depending 
on whether an estimated approximation error between 
the contour segment and its corresponding line seg- 
ment is greater than the threshold TH'. If the approxima- 
tion error is not greater than the threshold TH', 
representing that the approximation error is not consid- 
erable, only the vertices of the line segment are 
encoded and transmitted to a decoder of the receiving 
end; and the contour segment is approximately therein 
by the line segment. By carrying out the encoding of the 
contour, the amount of transmission data can be econo- 
mized while maintaining the picture quality. 

In another preferred embodiment of the invention, 
the contour encoder 20 of Fig. 1 can be constituted in 
such a way that the switching block 15 is positioned 
between the error detection block 12 and the transfer 
block 13. In such an instance, the transfer of the set of 
errors from the error detection block 12 to the transfer 
block 13 is controlled by a selection signal from the deci- 
sion block 16. 

While the present invention has been described 
with respect to certain preferred embodiments only, 
other modifications and variations may be made without 
departing from the scope of the present invention as set 
forth in the following claims. 

Claims 

1. A method for encoding a video signal of a contour 
of an object, comprising the steps of: 

dividing the contour into a multiplicity of contour 
segments, and approximating each contour 
segment by a line segment joining two end 
points of said each contour segment, thereby 
determining positions of vertices on the con- 
tour; 

encoding the positions of the vertices on the 
contour; 

calculating a set of errors for said each contour 
segment, wherein the set of errors represents a 
difference between said each contour segment 
and the line segment corresponding thereto; 
coding the set of errors to provide a set of 
coded errors; 

estimating an approximation error between 
said each contour segment and the line seg- 
ment corresponding thereto; 
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if the approximation error is larger than a 
threshold value TH, providing the set of coded 
errors and the set of encoded positions of the 
vertices on the contour as encoded data for the 
contour; and 5 
if the approximation error is smaller than or 
equal to the threshold value TH, providing the 
set of the encoded positions of the vertices on 
the contour as encoded data for the contour. 

10 

2. The method of claim 1 , wherein the approximation 
error is a length of the line segment. 

3. The method of claim 1 , wherein the approximation 
error is a value obtained by dividing an area is 
between said each contour segment and the line 
segment corresponding thereto by a length of the 
line segment. 

4. The method of claim 1 , wherein the approximation 20 
error is a value obtained by dividing the number of 
pixels surrounded by the line segment and said 
each contour segment by a length of the line seg- 
ment 

25 

5. The method of claim 1 , wherein the approximation 
error is a value obtained by dividing a distance from 
the line segment to the farthest point on said each 
contour segment by a length of the line segment. 

30 

6. The method of claim 1 , wherein the approximation 
error is a value obtained by dividing the sum of dis- 
tances from a sample point lying between said each 
contour segment and a plurality of points on the line 
segment by a length of the line segment. 35 

7. The method of claim 6, wherein the plurality of 
points are equi-distanced from each other. 

8. An apparatus for encoding a video signal of a con- 40 
tour of an object comprising: 

means for polygonal -approximating the con- 
tour, wherein the contour is divided into a mul- 
tiplicity of contour segments, and each contour 45 
segment is approximated by a line segment 
joining two end points of the contour segment, 
thereby determining positions of vertices on 
the contour; 

means for encoding the positions of the verti- so 
ces on the contour; 

means for calculating a set of errors for each 
contour segment, wherein the set of errors rep- 
resents a difference between each contour 
segment and the line segment corresponding 55 
thereto; 

means for estimating an approximation error 
between each contour segment and the line 



segment corresponding thereto; 

means for comparing the approximation error 

with a threshold TH to thereby generate a 

selection signal when the approximation error 

is greater than the threshold TH; and 

means for coding the set of errors in response 

to the selection signal to provide a set of coded 

errors. 

9. The apparatus of claim 8, wherein said approxima- 
tion error is a length of the line segment 

10. The apparatus of claim 8, wherein said approxima- 
tion error corresponds to an area confined by each 
contour segment and the line segment divided by a 
length of the line segment. 

11. The apparatus of claim 8, wherein said approxima- 
tion error corresponds to the number of pixels posi- 
tioned inside each contour segment and the line 
segment divided by a length of the line segment. 

12. The apparatus of claim 8, wherein said approxima- 
tion error is defined by the distance from the line 
segment to the farthest point on each contour seg- 
ment divided by a length of the line segment. 

13. The apparatus of claim 8, wherein said approxima- 
tion error is the sum of the distances from sample 
points taken on the line segment by a length of the 
line segment 

14. The apparatus of claim 13, wherein said sample 
points are equi-distanced from each other. 

15. The method for encoding a contour of an object, 
comprising the steps of: 

(a) partitioning the contour into a multiplicity of 
contour segments by a plurality of vertices, two 
end points of each contour being vertices: 

(b) estimating an approximation error between 
each contour segment and a line segment join 
two vertices of each contour segment; 

(c) comparing the approximation error with a 
preset threshold TH; 

(d) if an approximation error for a contour seg- 
ment is greater than the threshold TH, calculat- 
ing a set of errors between the contour 
segment and a line segment corresponding 
thereto; and 

(e) encoding position information of the verti- 
ces and a set of errors for each contour seg- 
ment having an approximation error greater 
than the threshold TH. 

16. The method according to claim 15, wherein said 
approximation error is represented by one of a 
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value selected from the group consisting of (a) a 
length of the line segment, (b) an area between 
each contour segment and the line segment divided 
by the length of the line segment, and (c) the 
number of pixels confined by each contour segment $ 
and the line segment divided by the length of the 
line segment. 

17. The method according to claim 15, wherein said 
approximation error is represented by a maximum w 
distance between each contour segment and the 
line segment divided by a length of the line seg- 
ment. 

18. The method according to claim 15, wherein said is 
calculating step includes the steps of: 

(a1) determining a predetermined number of 
sample points on the line segment; and 
(a2) finding a displacement between each 20 
sample point and an intersection formed by 
said each contour segment and a line normal 
to the line segment at each sample point. 
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(54) Method and apparatus for encoding a contour of an object 



(57) A method for encoding a video signal of a con- 
tour of an object, which is capable of reducing the 
amount of transmission data through selectively trans- 
forming the approximation errors, is disclosed. The 
method divides the contour into a multiplicity of contour 
segments, and approximates each of the contour seg- 
ments by a line segment, thereby determining positions 
of vertices on the contour. Then, it calculates a set of 
errors between the contour segment and the line seg- 
ment, wherein the set of errors is a difference between 
each contour segment and the line segment corre- 
sponding thereto, and then calculates a value for deter- 



mining the difference between the contour segment and 
the line segment. Thereafter, the method encodes and 
generates the positions of the vertices on the contour 
and the set of errors if the calculated value for determin- 
ing the difference between the contour segment and the 
line segment is larger than a threshold value TH. Alter- 
natively, if the calculated value is smaller than the 
threshold value TH, the method encodes and generates 
only the positions of the vertices on the contour, thereby 
reducing the amount of transmission data. 
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